Search CORE

89 research outputs found

Joint Alignment and Modeling of Correlated Behavior Streams

Author: LO PRESTI L.
Rozga A.
Sclaroff S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

The Variable Time-Shift Hidden Markov Model (VTS- HMM) is proposed for learning and modeling pairs of cor- related streams. Unlike previous coupled models for time series, the VTS-HMM accounts for varying time shifts be- tween correlated events in pairs of streams having different properties. The VTS-HMM is learned on a set of pairs of unaligned streams and, thus, learning entails simultaneous estimation of the varying time shifts and of the parameters of the model. The formulation is demonstrated in the analysis of videos of dyadic social interactions between children and adults in the Multimodal Dyadic Behavior Dataset (MMDB). In dyadic social interactions, an agent starts an interaction with one or more \u201cinitiating behaviors\u201d that elicit one or more \u201cresponding behaviors\u201d from the partner within a temporal window. The proposed VTS-HMM explicitly accounts for varying time shifts between initiating and responding behaviors in these behavior streams. The experiments confirm that modeling of these varying time shifts in the VTS-HMM can yield improved estimation of the level of engagement of the child and adult and more accurate dis- crimination among complex activities

Archivio istituzionale della ricerca - Università di Palermo

Gesture Modeling by Hanklet-based Hidden Markov Model

Author: Camps O.
LA CASCIA Marco
LO PRESTI Liliana
Sclaroff S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

In this paper we propose a novel approach for gesture modeling. We aim at decomposing a gesture into sub-trajectories that are the output of a sequence of atomic linear time invariant (LTI) systems, and we use a Hidden Markov Model to model the transitions from the LTI system to another. For this purpose, we represent the human body motion in a temporal window as a set of body joint trajectories that we assume are the output of an LTI system. We describe the set of trajectories in a temporal window by the corresponding Hankel matrix (Hanklet), which embeds the observability matrix of the LTI system that produced it. We train a set of HMMs (one for each gesture class) with a discriminative approach. To account for the sharing of body motion templates we allow the HMMs to share the same state space. We demonstrate by means of experiments on two publicly available datasets that, even with just considering the trajectories of the 3D joints, our method achieves state-of-the-art accuracy while competing well with methods that employ more complex models and feature representations

CiteSeerX

Crossref

Archivio istituzionale della ricerca - Università di Palermo

Modal matching for correspondence and recognition

Author: A.P. Pentland
S. Sclaroff
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

A unified framework for domain adaptive pose estimation

Author: Betke Margrit
Kim Donghyun
Saenko K.
Sclaroff S.
Wang Kaihong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/04/2022
Field of study

While pose estimation is an important computer vision task, it requires expensive annotation and suffers from domain shift. In this paper, we investigate the problem of domain adaptive 2D pose estimation that transfers knowledge learned on a synthetic source domain to a target domain without supervision. While several domain adaptive pose estimation models have been proposed recently, they are not generic but only focus on either human pose or animal pose estimation, and thus their effectiveness is somewhat limited to specific scenarios. In this work, we propose a unified framework that generalizes well on various domain adaptive pose estimation problems. We propose to align representations using both input-level and output-level cues (pixels and pose labels, respectively), which facilitates the knowledge transfer from the source domain to the unlabeled target domain. Our experiments show that our method achieves state-of-the-art performance under various domain shifts. Our method outperforms existing baselines on human pose estimation by up to 4.5 percent points (pp), hand pose estimation by up to 7.4 pp, and animal pose estimation by up to 4.8 pp for dogs and 3.3 pp for sheep. These results suggest that our method is able to mitigate domain shift on diverse tasks and even unseen domains and objects (e.g., trained on horse and tested on dog). Our code will be publicly available at: https://github.com/VisionLearningGroup/UDA_PoseEstimation.N00014-19-1-2571 - Department of Defense/ONRhttps://doi.org/10.1007/978-3-031-19827-4_35First author draf

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Memetic electromagnetism algorithm for surface reconstruction with rational bivariate Bernstein basis functions

Author: A Gálvez
A Gálvez
A Gálvez
A Gálvez
A Gálvez
A Gálvez
A Iglesias
A Iglesias
Akemi Gálvez
Andrés Iglesias
D Meyers
DR Forsey
E Castillo
E Castillo
F Schmitt
F Yoshimoto
G Farin
H Akaike
H Akaike
H Fuchs
H Park
H Park
H Park
H Pottmann
I Maekawa
J Barhak
L Piegl
M Hoffmann
M Jones
MC Leu
MG Cox
NM Patrikalakis
NR Draper
P Gu
R Luus
RE Barnhill
RH Franke
RM Bolle
S Sclaroff
SI Birbil
SI Birbil
T Varady
TA Foley
V Pratt
V Savchenko
W Li
WJ Gordon
WY Ma
X Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Surface reconstruction is a very important issue with outstanding applications in fields such as medical imaging (computer tomography, magnetic resonance), biomedical engineering (customized prosthesis and medical implants), computer-aided design and manufacturing (reverse engineering for the automotive, aerospace and shipbuilding industries), rapid prototyping (scale models of physical parts from CAD data), computer animation and film industry (motion capture, character modeling), archaeology (digital representation and storage of archaeological sites and assets), virtual/augmented reality, and many others. In this paper we address the surface reconstruction problem by using rational Bézier surfaces. This problem is by far more complex than the case for curves we solved in a previous paper. In addition, we deal with data points subjected to measurement noise and irregular sampling, replicating the usual conditions of real-world applications. Our method is based on a memetic approach combining a powerful metaheuristic method for global optimization (the electromagnetism algorithm) with a local search method. This method is applied to a benchmark of five illustrative examples exhibiting challenging features. Our experimental results show that the method performs very well, and it can recover the underlying shape of surfaces with very good accuracy.This research is kindly supported by the Computer Science National Program of the Spanish Ministry of Economy and Competitiveness, Project #TIN2012-30768, Toho University, and the University of Cantabria. The authors are particularly grateful to the Department of Information Science of Toho University for all the facilities given to carry out this work. We also thank the Editor and the two anonymous reviewers who helped us to improve our paper with several constructive comments and suggestions

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UCrea

Content-based representation and retrieval of visual media: A state-of-the-art review

Author: A. Hampapur
A. Nagasaka
B.C. O'Connor
Dragutin Petkovic
F. Salazar
F. Salazar
H. Tamura
H.J. Zhang
H.J. Zhang
Hongjiang Zhang
J. Ens
M. Hawley
N.-S. Chang
P. Aigrain
P. Aigrain
P. Aigrain
P. Lepain
Philippe Aigrain
S. Sclaroff
V. Guigueno
V.M. Bove Jr.
V.N. Gudivada
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

3D Human Motion Tracking with a Coordinated Mixture of Factor Analyzers

Author: A. Elgammal
B. Schölkopf
C. Bishop
D. MacKay
D. MacKay
D. Ramanan
G. Schwarz
J. Tenenbaum
J. Verbeek
J. Wang
L. Wang
Ming-Hsuan Yang
R. Kass
R. Poppe
R. Roweis
Rui Li
S. Richardson
Stan Sclaroff
Tai-Peng Tian
W. Jefferys
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Multiscale Symmetric Part Detection and Grouping

Author: A Pentland
A Shokoufandeh
A Shokoufandeh
A Shokoufandeh
A Ylä-Jääski
Alex Levinshtein
AP Pentland
Cristian Sminchisescu
D Macrini
D Macrini
DD Hoffman
DG Lowe
DR Martin
HA Blum
I Biederman
I Biederman
J Crowley
J Crowley
J Ponce
J Shi
J Stahl
JH Connell
K Siddiqi
K Siddiqi
M Brady
M Kass
M Pelillo
P Saint-Marc
PF Felzenszwalb
S Sclaroff
Sven Dickinson
T Sebastian
TF Cootes
TJ Cham
Y Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Modal Matching for Correspondence and Recognition

Author: A. Pentland
S. Sclaroff
Publication venue
Publication date: 01/01/1995
Field of study

Modal matching is a new method for establishing correspondences and computing canonical descriptions. The method is based on the idea of describing objects in terms of generalized symmetries, as defined by each object's eigenmodes. The resulting modal description is used for object recognition and categorization, where shape similarities are expressed as the amounts of modal deformation energy needed to align the two objects. In general, modes provide a global-to-local ordering of shape deformation and thus allow for selecting which types of deformations are used in object alignment and comparison. In contrast to previous techniques, which required correspondence to be computed with an initial or prototype shape, modal matching utilizes a new type of finite element formulation that allows for an object's eigenmodes to be computed directly from available image information. This improved formulation provides greater generality and accuracy, and is applicable to data of any dimensionality. Correspondence results with 2-D contour and point feature data are shown, and recognition experiments with 2-D images of hand tools and airplanes are described

CiteSeerX

Boston University Institutional Repository (OpenBU)